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No. 60/018,282. 

20 

Background of the Invention 

The present invention relates to optimization and, more particularly, to optimization of 

biocatalyst performance. 
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Many industrial processes are based on catalytic reactions. Enzymes that are isolated 
from natural sources have evolved to be efficient and selective catalysts for the chemical 
reactions that take place in living systems. These natural enzymes are often not well suited to 
these tasks due to overproduction or use of an unnatural substrate, poor substrate solubility, 
5 breakdown of unstable products, or competing chemical reactions. In addition, many industrial 
processes involve substrates, organic solvents, and other harsh reaction conditions that are not 
encountered in nature. Several common industrial applications for enzymes, or biocatalysts, 
include serving as catalysts for chemical synthesis, additives for enhancing laundry detergent 
performance, or the use in water treatment plants as bioremediators of potentially carcinogenic or 
ffl) toxic compounds. For industrial applications, enzymes can be engineered to carry out specific 
m functions including, but not limited to, increasing activity and stability in non-aqueous solvents, 
jj" Some examples of generating libraries of enzymes are shown in U.S. Patent No. 6,218,163, WO 
a r 99/67420, and WO 00/01842. U.S. Patent No. 6,218,163, WO 99/67420, and WO 00/01842 are 
flj hereby incorporated by reference in its entirety. 

33 In combination with the isolation of new enzymes, engineering of natural enzymes to 

- A - perform new functions has increased. Enzymes may be altered to perform a new and specific 
function as well as to enhance or remove an existing function. Rational protein engineering 
based on the site-directed mutagenesis of individual amino acids can be applied to achieve this 
goal. However, this requires a detailed knowledge about the structure-function relationship of 
20 the enzyme, which is seldom available and time consuming to obtain. In addition, the mutation 
of key residues may dramatically affect the stability of the protein. The application of rational 
protein design can be time-consuming and expensive. Moreover, screening of libraries 
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containing numerous mutant enzymes to characterize modified catalytic functions and to 
determine optimal catalytic conditions is also laborious and expensive. 
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Summary of the Invention 

Traditional high-throughput screening involves a time-consuming manual survey of 
many different reaction conditions with the examination of one variable at a time. Automated 
technology is much more efficient, allowing scientists to examine different reaction conditions 
5 simultaneously while working at a very small scale. Instead of producing chemical libraries, the 
output from APR provides libraries of assay conditions. Researchers can rapidly examine 
multiple variables and their interactions in a defined series of reactions to minimize the number 
of experiments necessary for optimization of the assay conditions. APR merges the automated 
capability and the small scale of combinatorial chemistry with statistical design of experiments 

!"|0 (DoE). This technology can be applicable to the field of biocatalysis in several ways. APR 
greatly decreases the amount of time required for the high-throughput screening of mutant 

Mi libraries and is also a valuable tool for designing and carrying out a finite number of experiments 

^ to optimize the performance of a biocatalyst. 
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Brief Description of the Drawings 

The following discussion will make reference to the accompanying drawing figures, 
wherein like reference numerals refer to like elements in the various views, and wherein: 

Figure 1 is a diagram of the components of a preferred workstation for implementing the 
5 invention. 

Figure 2 is a block diagram illustrating the flow of commands and data between the 
computer and synthesizer, robotic arm and product analyzer of Figure 1 . 

Figure 3 is flow chart illustrating the sequence of steps in performing the preferred 
optimization routine using the equipment of Figure 1. 

Jjp Figure 4 is an additional block diagram of the computer, synthesizer, robot, and analyzer. 

| ; U Figures 5A-5G are an additional flow chart of the sequence of steps in performing the 

?w preferred chemical reaction optimization routine using the equipment of Figure 1 . 

w £ Figure 6 is an example of a chemical equation using E009 enzyme. 

y Figure 7 is a 3 -dimensional contour plot of the conversion response with E009 enzyme 

J45 being constant at 5 mg/2 mL. 

12 Figure 8 is a 3 -dimensional contour plot of the conversion response with Phenylethyl 

Acetate being constant at 37.5 mM. 

Figure 9 is a conversion contour plot with E009 enzyme being constant at 5 mg/2 mL. 
Figure 10 is a conversion contour plot with the percentage of ACN being held constant at 



-6- 

MCDONNELL BOEHNEN HULBERT & BERGHOFF 
300 South Wacker Drive, 32 nd Floor 
Chicago, Illinois 60606 
(312) 913-0001 



X 1 

Detailed Description of the Presently 
Preferred Embodiments 

A different approach, known as directed evolution, can give rapid access to an enzyme 

5 with the desired properties. This process utilizes a variety of methods such as sequential random 

mutagenesis, error-prone mutagenesis or gene shuffling in combination -with high-throughput 

screening or selection to identify libraries of potential biocatalysts. Directed evolution does not 

require any prior knowledge of the stracture-function relationship. The ultimate goal for this 

process is the production of a catalyst that enhances or removes a defined application-specific 

10 function of the natural enzyme. Examples of this function include a mutant protein with 

f=s increased activity, greater stability in organic solvent, or broader substrate specificity with 

;S respect to the native enzyme. An important element to a successful directed evolution program 

H is the high throughput screening assay. The identification of the best suited mutant protein is 

^ highly dependent on the selected screening conditions. 

ji?issi 

The major steps of directed evolution include the selection of the gene, the creation of a 
2 variant library, insertion of the library into an expression vector, expression of the gene library to 

.MISS. 

l! produce the mutant enzyme libraries, screening of the mutant enzymes for the property of 

interest, and the isolation of the gene corresponding to the improved variant properties so that the 

cycle can be repeated several times. The generation and screening of mutants with improved 

20 performance is carried out in iterative steps. After several cycles, the performance of mutant 

proteins should be optimal under the application-specific conditions. 

The high-throughput screening assay must be designed to test the new or modified 

function of the biocatalysts. High-throughput screening may consist of a survival, heat 

evolution, or colorimetric assays so that a positive result is observed. The identification of the 

25 best biocatalyst is highly dependent on the screening conditions. 
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Biocatalysts may be applied to the chemical synthesis of fine chemicals. Advantages of 
biocatalysts over conventional synthetic methods include a decrease in resource efficiency as 
well as the reduction of waste. Biocatalysts offer the potential to reduce the unwanted 
byproducts that are generated by chemical synthesis. The use of biocatalysts with increased 
5 stability or catalytic activity would replace chemical reaction that are difficult to achieve with 
standard methods and eliminates high volumes of hazardous waste. 

Directed evolution is severely limited by both the time involved in screening mutant 
libraries for a specific function and in the optimization of the assay conditions for a biocatalyst 
High-throughput screening of mutant libraries may involve hundreds of substrates. Additional 
So time may be required to synthesize substrates that are not commercially available. When a 
fU positive result is observed for the screening assay, multiple variables must be examined to 
W determine the conditions for optimum enzyme activity. The variables that may affect enzyme 
activity are numerous and not all will significantly contribute to the desired function. The 
K stability of the protein may be an issue when an organic co-solvent is necessary or if the 
f|5 production of a biocatalyst with activity in organic solvent is the desired goal. Examples of 
W solvents include MeCN, MeOH, EtOH, DMF, H 2 0 ? aqueous buffers and mixtures thereof Each 
substrate that exhibits a positive screening result must be explored in greater detail to determine 
the conditions for optimum enzyme activity. 

Thus, Automated Process Research (APR) technology is a powerful tool for the 
20 pharmaceutical industry to generate lead compounds as potential drug candidates. This 
technology differs from traditional methods by utilizing automated equipment coupled with the 
statistical design of experiments to rapidly synthesize diverse libraries of chemical compounds. 
In the past, automated technology has been confined to defined areas of chemical synthesis. 

-8- 

McDonnell Boehnen Hulbert & Berghoff 
300 South Wacker Drive, 32 nd Floor 
Chicago, Illinois 60606 
(312)913-0001 



However, APR technology has unlimited potential and can be applied to any repetitive task that 
is performed in the laboratory. 



1. APR and high-throughput screening of mutant libraries. 

5 The high-throughput screening of substrates for improved function may potentially 

require hundreds of reaction assays. The automated equipment may be of great use in facilitating 
this screening process. There may be various methods for the identification of a positive assay 
result, depending on the function of the protein that is being enhanced or removed. One method 
of detection is a colorimetric assay. The chromophoric properties of the substrate or the presence 

13 

^§0 of color indicators would allow the detection of a positive result. In some cases, a visible 

\Q 

-1 screening assay may not be possible, depending on the function of the mutant library or the 
ll nature of the substrate. The high-throughput robotics of the APR equipment would be an 
s' efficient way to analyze the product formation by chromatography. In addition, APR can 
M significantly reduce the time and costs associated with the traditional screening of substrates, 
yjjs The statistical design of experiments may aid in the screening of mutant libraries. For 

example, the goal of directed evolution might be to generate a protease that exhibits increased 
activity and stability in organic solvent. A finite number of experiments can be designed at 
several concentrations of organic solvent to determine the effect of organic solvent on enzyme 



activity. 

20 2. Rapid optimization of biocatalyst performance. 

Once positive results have been obtained from the high-throughput screening 
experiments, the optimization of the assay conditions for each of the positive results will be 
necessary. The optimization of biocatalyst performance can be carried out by APR, combining 
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the use of automated equipment with the statistical design of experiments. Enzyme activity is 
affected by numerous factors including the concentrations or presence of substrates, activators, 
inhibitors or co-factors as well as the effects of salts, buffers, pH, temperature, solvent 
composition, ionic strength, and perhaps, the interactions with other proteins. The nature of the 
5 buffer may be important since some buffers can act as inhibitors of the enzymes. Salts may be 
essential for activity, especially with enzymes from extreme environments such as those found 
in thermophilic or halophilic bacteria. Metal ions essential for activity may be complexed by 
the buffer. To achieve maximum catalytic activity, one or more of these variables should be 
optimized, preferably all. 

%0 Three basic experimental designs can be utilized to enhance the biocatalyst activity. The 

goal of the first experimental plan would be to determine which experimental parameters, or 
|i variables, contribute to the optimization of enzyme activity. This can be accomplished with a 
■ Plackett-Burman design. This model calculates a finite number of experiments to screen all 
W variables that are entered over user-defined ranges. The statistical design is coupled with the 
5 automated equipment to carry out the experiments and the results may be plotted, for example, 
r " in a bar graph format. Many variables may be statistically insignificant within the user-selected 
ranges and can be held constant in future experimental designs. The Plackett-Burman design 
does not provide information about the optimum assay conditions, but instead determines the 
parameters that are critical for enzyme activity. In the second experimental plan, a fractional 
20 factorial design is implemented to provide a rough estimate of the optimum conditions for 
enzyme activity. The parameters that were insignificant within experimental plan 1 are held 
constant and the others are varied over a broad range. The results may be displayed, for 
example, in a table format. A more accurate catalytic performance enhancement can be 
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determined with a third experimental plan, the surface response design. The parameters that are 
less significant to achieve optimum activity in the second experimental plan are held constant so 
that the more critical variables can be optimized. 

In some cases, the substrates that may be desired for the high-throughput screening 
5 experiments may not be commercially available. APR technology may then be applied to 
chemical process development for the rapid optimization of the desired substrate. The use of 
synthetic substrates for screening libraries of mutant proteins may be necessary. For example, 
APR can be utilized to examine the effect of carbon chain length on the enzyme activity. 

The use of statistically designed experiments coupled with automated equipment can 
Jo greatly facilitate the identification of enhanced biocatalytic performance. The examination of 

PI; 

F U one variable at a time would require a vast number of experiments and the optimum assay 
IU conditions would not be easily obtained. This process would then be repeated for each substrate 
= P that exhibited a positive result within the high-throughput screening. APR can greatly reduce 
S the time and cost involved in this process. Whether screening mutant libraries for activity, 

05 optimizing assay conditions, applying chemical process development, or reformulating drugs, 
U APR identifies crucial parameters. By employing DoE, cost consequence can be identified for 

process variables. Automated process research is a valuable business/technology platform with 

many applications. 

3. System Overview 

20 In this description, the novel application of automated technology to optimization of 

biocatalyst performance is disclosed. The basic concept is to have a machine perform the 
repetitive procedures involved in process development in order to increase the efficiency with 
which data can be collected and analyzed for a given biocatalyst, 
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A preferred workstation for implementing the invention is shown in Figure 1. The 
workstation 10 includes a synthesizer 12 having a reaction block 14 having, for example, 96 
reaction wells 16. In experiments which require adjustment of the temperature, the synthesizer 
12 may be equipped with a temperature control system 18 for adjusting the temperature of the 
5 block 14, so as to control the temperature of the wells 16. Preferably, the temperature control 
system 18 has the capability of controlling the temperatures of the wells individually, so that the 
conditions in the wells 16 can be customized. The synthesizer may include a lid or cover 20. In 
one embodiment, a source 22 of nitrogen or argon gas or any appropriate gas mixture may be 
connected to the synthesizer 12 via a conduit 24, if necessary, which enables a control of the 
*|0 atmospheric conditions above the wells. Mixing mechanisms such as a vortex mixer or an 
m orbital shaker can be built into the synthesizer 12 to assist in the mixing of the reagents in the 
fy wells. 

The synthesizer 12 further includes a robotic arm assembly 26 which has pipetting 
K capability for selectively adding quantities of one or more reagents to the wells 16. The robotic 
jfSjs arm assembly 26 includes an X-Y drive mechanism 28 or other suitable means for controlling 
ii the position of the pipetting tip portion 30 of the arm assembly relative to the wells. 

In one embodiment, the analytical functions are manually performed by the operator by 
examining the samples (e.g., if the pH is the subject of analysis and if a chemical is added to the 
wells to indicate the pH of the reaction based on the color of the well, the pH may be manually 
20 determined by visually examining the color change of the well). Alternatively, the station 10 
may includes an analytical instrument 40 for conducting an analysis of the reaction. The 
analytical instrument 40 varies based on the requirements of the analysis. In one embodiment, 
an HPLC machine (for example, a chiral HPLC machnine) is used to examine at least one aspect 

-12- 



McDonnell Boehnen Hulbert & Berghoff 
300 South Wacker Drive, 32 nd Floor 
Chicago, Illinois 60606 
(312)913-0001 



of the reactions in the wells. This one aspect may include the product of the reaction, the by- 
products of the reaction, the unreacted substrate, and the enantiomer selectivity. 

The products from the synthesizer 12 can be either manually loaded into the analytical 
instrument 40, or loaded automatically with the assistance of suitable robotic arms or other 
5 equipment, represented by robot 50 in Figure 2 or other suitable mechanical system. 

The operation of the synthesizer 12 and analytical instrument 40 may be controlled by a 
computer 42, as shown in the block diagram of Figure 2. The computer 42 regulates the 
environmental conditions in the synthesizer 12 such as by controlling the temperature of the 
wells 16. The quantity and type of reagents added to the wells are also controlled by the 
S) computer 42, as is the position of the arm 26 relative to the wells 16. The computer 42 further 
IU initiates and controls the analysis of the reaction in the analytical instrument 40, and receives the 
^ analytical data from the instrument 40. The computer 42 further implements a design of 
experiment program (DoE) that is used to identify the optimal conditions for the reaction being 
J5 studied, as described below. It will be understood that some or all of the control functions of the 
0 computer 42 may be integrated into one or more of the individual components of the system 10. 
H Where the products are automatically loaded into the product analyzer 40, the computer 42 
controls a robot 50 to perform this task. 

An additional block diagram of the computer, synthesizer, robot, and analyzer is shown 
in Figure 4. The computer 42 contains a processor 64 which communicates with non-volatile 
20 (read only memory, ROM 68) and volatile (random access memory, RAM 70) memory devices. 
The processor 64 also has a comparator 66 for comparing values. The processor 64 executes a 
computer program. The computer program is stored in the ROM 70 and executed either in the 
RAM 68 or the ROM 70. 

-13- 

McDonnell Boehnen Hulbert & Berghoff 
300 South Wacker Drive, 32 nd Floor 
Chicago, Illinois 60606 
(312)913-0001 



The processor 64 communicates with various subcomponents of the synthesizer 12, the 
analyzer 40 and the robot 50. The synthesizer contains a temperature control system 18 which 
controls the temperature of each of the individual wells of the block. The processor sends a 
command to the temperature control system 18 specifying a certain temperature for a particular 
5 well. The synthesizer also contains an agitator/mixer 76 which agitates or mixes the individual 
wells. There are two different methods of agitating or mixing. The first method is to agitate the 
block as a whole whereby each of the wells are shaken at the same rate. To do this, the entire 
reaction block is agitated at one rate. The second method is to mix each of the individual wells at 
different rates. Each well is equipped with at metal stirrer underneath the well. Inside the well is 
% ^0 a TEFLON-coated magnet which follows the motion of the metal stirrer underneath the well. In 
^ this manner, the individual well is stirred based on the rate at which the metal stirrer is rotated. 
|,I The rate of rotation is set by the processor 64. 

s' The synthesizer also contains an atmospheric regulator 78 which protects the components 

|9 in the wells if the components are sensitive to oxygen, carbon monoxide, or other materials in the 
5|5 environment in proximity to the well. Nitrogen, argon gas or any appropriate gas may be 
*"* dispensed from the source 22 through the conduit 24 based on a valve which is controlled by the 

valve motor 80. The valve motor is controlled by the processor 64. 

The synthesizer further contains a drive 28 for moving the robotic arm assembly 26. As 

described above, the robotic arm assembly 26 has pipetting capability for selecting, obtaining 
20 and dispensing one or more reagents. The pipetting capability is performed through a pipetting 

mechanism 74 which draws reagents through the pipetting tip portion 30 and stores one or more 

reagents in the robotic arm assembly 26. Subsequently, the one or more reagents are dispensed 
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via the pipetting mechanism 74 into the wells. Both the drive 28 and the pipetting mechanism 74 
are controlled by the processor 64. 

The analyzer 40 and robot 50 are in communication with the processor 64 as well. The 
processor 64 controls the drive 72 of the robot 50 which extracts samples from each of the wells. 
5 The samples are transferred to the analyzer 40 which analyzes one aspect of the mixture 
including the product of the reaction, the by-products of the reaction, the unreacted substrate, 
and/or the enantiomer selectivity. 
4. Detailed Methodology 

Referring to Figure 3, there is shown a block diagram of the methodology for the reaction 
#0 optimization and will be described in conjunction with the system in Figure 1. As shown at 
f y block 52, the reagents are dispensed in the wells of the synthesizer. 

S'« Variables which may be varied in a set of experiments include, but are not limited to: 

types of enzymes; amounts of enzymes; types of solvents/buffers; amounts of solvents/buffers; 
m temperature; pressure; pH; types of substrates; time; enzyme - substrate ratio; and agitation 
05 (whether to agitate and the speed of agitation). The values for the first set of experiments (see 
!M block 49 of Figure 3) may be chosen through a variety of ways. For example, the values may be 
chosen manually by the operator. Alternatively, a range of values for the variables may be 
chosen and the values for each of the 96 initial experiments may be chosen randomly or 
periodically with the range of values available. For example, given the number of enzymes 
20 available for testing, the range of temperatures, the types of solvents/buffers, amounts of 
solvents/buffers, the substrates available, etc., an initial set of experiments maybe chosen. 

After the values for the initial experiments are chosen, the experiments are run. In Steps 
1-3 (blocks 52, 54 and 54 of Figure 3), the synthesizer 12 containing a 96-well reaction block 14 
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is used for the reaction of interest, and the robotic arm 26 can be programmed to dispense precise 
amounts of reagents into each well (see Figure 1). Each well 16 contains a separate experiment. 
Depending on the environmental conditions necessary for the experiments 
(temperature/pressure/etc.), the environmental conditions within each well can be controlled and 
5 the contents of each well can be efficiently mixed. The reagents can then be quenched and 
worked up using the same robotic technology, as shown at step 3 (block 56) of Figure 3. For 
example, if the reagents require a particular temperature, pressure or time period, the computer 
sends this information to the synthesizer to create the predetermined conditions for the 
experiment. This process alleviates the operator from performing repetitive tasks and increases 
CO the efficiency with which information can be gathered. 

W Once the 96 reactions are completed, at step 4, as shown at block 58 of Figure 3, the tasks 

I y of compound analysis and data compilation begin. The analysis, as discussed previously, may be 
T manual or automated. Manually, the operator may examine the reaction wells for a particular 
m characteristic. For example, the operator may examine the color of the well to determine the 
□5 extent of the reaction by examining a change in pH. The operator may then rate each of the 
M- samples as "good" or "bad" or rank them in order from "best" to "worst." Alternatively, the 
analyzer 40 may automatically examine the samples. The success of each of these 96 reactions 
can be evaluated using the analytical techniques which was already developed for the parent 
reaction. For example, HPLC (High Pressure Liquid Chromatography), and in particular chiral 
20 HPLC, might be the analytical method of choice. In this case, the reactions would be manually or 
automatically transferred to vials which fit in an HPLC autosampler 40. Alternatively, an 
LC/MS machine or a gas chromatography machine may be used. The reactions may be analyzed 
for at least one component including, for example, the products of the reaction, the by-products 
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X 1 

of the reaction, the unreacted substrate, and the enantiomer selectivity (whether (R) or (S) 
enantiomer). 

At this point (step 5), the results are compiled and analyzed. In one embodiment, in order 
to design the next set of experiments, the data compiled in step 4 is analyzed to determine 
5 common characteristics of a desired response. The analysis, in a preferred embodiment, is 
performed automatically by the computer 42. The analysis may include a two-step process: (1) 
determining the "success" or "failure" of each of the experiments; and (2) determining common 
characteristics of the "successful" experiments. For example, if the desired outcome of the 
experiments is enantiomeric selectivity (either (R) or (S)), the wells are examined to determine 
.Jo the reactions that resulted in good enantiomer selectivity. In addition, if other desired outcomes 
rU are of interest as well, such as product yield or time of reaction, the determination of whether an 
- ry experiment is a "success" may account for those variable(s) as well. 

The analysis may then focus on determining the common characteristics of the 
S experiments which were deemed "successful" (such as desired enantiomer selectivity). For 
jf 5 example, the analysis may determine common traits for the input variables for the "successful" 
U experiments, such as temperature, pressure, agitation, solvent, amount of catalyst, substrates, 
buffer, salts, pH, ionic strength, activators, inhibitors, etc.). These common traits may specify a 
certain temperature range, pressure range, etc. for the "successful" experiments. Alternatively, 
the analysis may focus on a "best" experiment/experiments and analyze the input variables for 
20 the "best" experiment/experiments. 

Once a statistical analysis of the data is performed, the statistical analysis is then 
transferred (either manually or automatically) to a means for designing the next set of 
experiments (step 6). The designing of the next set of experiments depends on the statistical 
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analysis, as discussed more fully below. As merely one example, the next set of experiments 
may be chosen based on the common characteristics determined in step 5. Specifically, the next 
set of experiments may be chosen from the ranges of values (e.g. , temperature, pressure, amount 
of catalyst, etc.) that were determined from the "successful" experiments. The next set of 
5 experiments may be evenly distributed within the ranges of values determined in step 5. 
Alternatively, the next set of experiments may be randomly chosen within the range of values 
determined in step 5. The concept of statistical design of experiments (DOE) may therefore be 
applied to aid in experimental design. In an alternate embodiment, the next set of experiments 
may be designed around the "best" experiment wherein the input variables for the "best" 

IS) experiment may serve as the basis for the ranges of the input variables for the next set of 

7J experiments. 

fU Commercially available computer programs can control the reaction conditions utilized 

+ by the synthesizer, perform statistical analyses and design the next set of experiments to conduct 
y the most effective DOE study. One such program is Design Expert by Stat Ease Corp. in 
fh5 Minneapolis, Minnesota, which uses a linear regression analysis. Specifically, the computer 
il program can analyze the data obtained from the analyzer to generate common characteristics 
from the data (such as linear regression analysis), to determine a range of "good" samples. 
Alternatively, the computer program may determine the "best" (based on established criteria) 
sample, or to determine the "worst" sample. The computer 42 can then correlate the data 
20 obtained and extrapolate to propose new experiments. The system may then iterate to 
subsequently confirm the proposed optimal conditions. Specifically, the computer program can 
take the common characteristics generated from the statistical analysis and propose new 
experiments based on the trends. Alternatively, the computer program can design a new set of 
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experiments localized around the reagents/conditions of the "best" sample. This iteration is 
represented by the arrow 51 in Figure 3. Basically, a new and potentially more narrowly 
circumscribed set of parameters (including types of enzymes, amounts of enzymes, types of 
solvents, amounts of solvents, time of reaction, or environmental conditions) are programmed in 
5 the synthesizer and robotic arm, and the process is repeated. For example, the range of values for 
the parameters in the new set of experiments may be different (either narrower or a different 
range) from the previous range of values. This procedure could iterate several times, until the 
optimal reaction conditions are determined with the desired level of precision. Alternatively, the 
procedure (steps 1-5) could just be performed once, with the computer 42 identifying which of 
S) the reaction wells 16 had the most favorable conditions for the reactions. 

f y Several types of methodologies may be used to design the next set of experiments (see 

'"4 

W step 6 of Figure 3) including the Monte Carlo method, the SDO method, and the "weights" 
^ method. Using a program which utilizes the Monte Carlo method, for instance, the operator can 

rWiSS. 

It define the space of parameters to be analyzed, run a series of random preliminary experiments in 
jJ5 this space, define a new space of parameters using the best of these preUminary experiments, run 
U additional experiments in the new space and continue this process until no further improvement 
is observed. For example, the operator defines a space of reaction parameters for each 
experiment, such as temperature, pressure, agitation, solvent (e.g., organic, H 2 0, etc.), type of 
catalyst, amount of catalyst, substrates, buffer, salts, pH, ionic strength, activators, inhibitors, 
20 and/or time period for reaction, and then performs several preliminary random experiments using 
the synthesizer. The analyzer data concerning the product yield, by-products, unreacted 
substrate, enantomer selectivity, for instance, are then analyzed by the computer to determine the 
"successful" experiments. Based on the "successful" experiments, the program then utilizes the 
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statistical method to generate a new space of parameters (e.g., temperature, concentration, 
pressure and time) for further experimentation. This new space of parameters may be based on 
values for the parameters from the "successful" experiments (e.g., the new space for the 
temperature parameter may be based on the temperatures from the "successful" experiments). A 
5 new set of reactions are then performed based on the new space of parameters and the resulting 
data from the experiments for the new space of parameters is then stored and processed by the 
Monte Carlo method as before. This process can be repeated until no further improvements are 
obtained. 

Alternatively, a program which utilizes the SDO method generates a set of experiments in 
;|o all of the variables of interest for the operator. When these experiments have been run, the 
%] experiment that gave the worst result is identified among the set. This experiment is then 

If 3 s 

U discarded and replaced with a new experiment. When the replacement experiment has been run, 
= the worst of the set is again identified and discarded. This process continues until no further 

improvement is observed. For example, the operator performs preliminary experiments with the 
S 5 synthesizer using SDO variables of interest. The data, in combination with the variables, are then 
r analyzed by the program. The program would then eliminate the experiment with the worst result 
and generate a new proposed experiment. This process is repeated until no further improvements 
in product yield, for instance, are obtained. 

Another method to analyze the data in the newly created table is by first determining the 
20 "weights" for each of the reaction parameters. The reaction parameters may include temperature, 
pressure, agitation, solvent (e.g., organic, H 2 0, etc.), amount of catalyst, substrates, buffer, salts, 
pH, ionic strength, activators, inhibitors, and/or time period for reaction. Prior to execution of 
the program, the operator assigns "weights" based on importance of each reaction parameter. In 
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this manner, the results of each of the wells can be assigned a total "score" by multiplying the 
reaction parameters by the "weights" and adding them. Each of the results for an individual well 
can then be tallied. For parameters which are of great importance, those parameters are weighted 
accordingly. For parameters which are more desirable when they are lower in value, e.g. the 
5 time of reaction, the result of multiplying the weight by the parameter can be inverted, and then 
added to the total to determine the "score." 

The entries can then be arranged based on the score. The processor 64 then displays the 
results of the raw data and the "scores." At each step in the methodology, the display can be 
updated to inform the operator of the current reaction. For example, when the processor 64 

*jo commands or receives information from the synthesizer 12, the analyzer 40 or the robot 50, the 

lj display can be updated to indicate the current operation. 

[I Based on the highest ranked "score," the suggested bounds for the next set of 

7 experiments may be determined. For example, if the temperature of the reaction is determined to 
£3 be an important parameter, the temperature value of the highest ranked "score" is used as a base 
5|5 value for the temperature bounds for the next set of experiments. The suggested parameters are 
then displayed to the operator. 

This automated process development technology allows a vast array of data to be 
collected and interpreted. Many combinations of reaction variables can be investigated in a short 
time period. Using the current manual technology, only a local optimization is found because it 
20 is too time consuming to investigate every set of reaction conditions. With the new automated 
technology presented here, a large number of statistical data points can be collected. In essence, a 
global optimization is found. The amount of data generated by this process is limited only by the 
number of variables that can be envisioned for a given reaction. 
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Figures 5A-5G are an additional flow chart of the sequence of steps in performing the 
preferred reaction optimization routine. The program which executes the operation of the 
automated sequence of operations, as stated above, is resident either in RAM 68 or ROM 70. The 
program first determines the initial values of ingredient concentrations and type of ingredients 
5 for each of the wells 82. Different ingredients may include enzymes, substrates, co-factors, 
solvents, etc. This is done so that the processor 64 can command the pipetting mechanism 74 to 
obtain the correct ingredients and the approximate amount of ingredients for use in all of the 
wells. As shown in Figures 5A-5G, the total number of wells is designated as "X." As discussed 
above, one reaction block 14 has, for example, 48 reaction wells 16. Reaction blocks with less or 
JjO more reaction wells may be used as well. 

rU The processor 64 then instructs the drive 28 to a particular x and y position to obtain the 

fy ingredients 84. The pipetting mechanism 74 then stores the ingredients in the dispenser of the 
* drive of the synthesizer 12, as shown at block 86 of FIG. 5. Then a loop is executed for each of 
}i the wells 1 6, with the well_number set equal to 1, as shown at block 88 of FIG. 5. The processor 
f|5 64 moves the motor of the drive 28 to the x and y position of the well 90, the ingredient values 
U and type of ingredients are determined by the processor 92, and the ingredients are dispensed 
into the well, as shown at block 94 of FIG. 5. The ingredient values and types of ingredients are 
determined by a parameter look-up table 69 (which contains all of the relevant parameters for the 
experiment) in the memory of the microprocessor. The ingredient values and types of ingredients 
20 may be based either on operator input or based on the optimization scheme described 
subsequently. The well_number is incremented by 1, as shown at block 96 of FIG. 5. If the 
well_number is greater than the total number of wells (X), then the loop is exited, as shown at 
block 98 of FIG. 5. Otherwise, the flow chart of FIG. 5 goes to block 90. 
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Alternatively, the pipetting mechanism, rather than storing the ingredient in the dispenser 
in one step and dispensing in another step may alternatively store the ingredient and dispense, 
sequentially for each well. Further, rather than automatic obtaining and dispensing of the 
ingredient, the operator may manually input the ingredient values into the wells. 
5 Prior to execution of the program, an ingredient-properties look-up table is created which 

determines, for a specific ingredient, whether the ingredient is sensitive to any other items, such 
as oxygen or water. This ingredient-properties look-up table may be separate and distinct from 
the parameter look-up table 69, or may be combined for operator convenience. Based on the 
ingredient-properties look-up table, if the ingredient is sensitive to oxygen or water 100, the 
*|o processor 64 opens the valve motor 80 to dispense either nitrogen or argon gas as shown at block 
rU 102 of FIG. 5. The well_number is set equal to 1, as shown at block 104 of FIG. 5. Then, the 
W clock for the processor 64 is checked with the value stored as the startjime of the experiment 
^ 106. A loop is then entered to set the temperatures of each of the wells. The temperature is 
;i determined for each well 108 by the parameter look-up table 69. The temperature in the 
rt5 parameter look-up table 69 is either based on operator input or based on the optimization scheme 
U described subsequently. The processor 64 sends a command to the temperature control system 1 8 
to set the temperature value 1 10. The well_number is incremented by 1, as shown at block 1 12 of 
FIG. 5. If the well_number is greater than the total number of wells (X), then the loop is exited, 
as shown at block 1 14 of FIG. 5. Otherwise, the flow chart of FIG. 5 goes to block 108. 
20 The agitation/mixing of the synthesizer is next initialized based on whether the individual 

wells are mixed at different rates or whether the entire reaction block is agitated at the same rate, 
as shown at block 116 of FIG. 5. If the agitation is at the same rate, the program determines the 
block agitation from the parameter look-up table 1 18 and sends a command to the agitator/mixer 
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120. If the agitation is at different rates, the program enters a loop, with the well_number set 
equal to 1 as shown at block 122 of FIG. 5, and determines the agitation from the parameter 
look-up table for each well 124 and sends a command to the agitator/mixer 126. The 
well_number is incremented by 1, as shown at block 128 of FIG. 5. If the well_number is greater 
5 than the total number of wells (X), then the loop is exited, as shown at block 130 of FIG. 5. 
Otherwise, the flow chart of FIG. 5 goes to block 124. In addition to mixing, the pH (which may 
be one of the parameters) in the wells may be modified using a pH machine. 

The reaction times are then determined for each of the wells, as shown at block 132 of 
FIG. 5 based on data in the parameter look-up table 69. The wells are ordered in an array based 
J) on the reaction time, from lowest to highest with a pointer set to the first item in the array, as 
*L' shown at block 134 of FIG. 5. The reaction time is determined for the well which is at the 
pointer, as shown at block 136. The reaction times are then checked based on checking the clock 
i5 P from the processor 64 and subtracting the time from the start value 138. When the reaction time 
has been exceeded for a particular well, as shown at block 140, the reaction is stopped 142. 
1*5 Stopping the reaction can be done in several ways including removing denaturing the enzyme or 
jfi removing aliquots from the wells as discussed below. The pointer is set to the next item in the 
array, as shown at block 144. As shown at block 146, if the pointer is outside of the array, the 
flow chart goes to block 152. Otherwise, the flow chart goes to block 136. 

After the reaction, the components of each of the wells 16 can be removed from each of 
20 the wells, sent to the analyzer 40 and analyzed. The well_number is set equal to 1, as shown at 
block 152 of FIG. 5. The processor 64 signals the drive 72 of the robot 50 to move to an x and y 
position 154, extract mixture from the well 156, and send the mixture to the analyzer 158. The 
analyzer 40 then analyzes the reaction mixture for a predetermined criteria, as shown at block 
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160, and sends the results to the processor 64. Some examples of predetermined criteria include: 
smell; viscosity; texture; time of processing; temperature of processing; pressure of processing; 
cost of ingredients; presence of certain ingredients; amounts of certain ingredients, etc. In the 
perfume example, the analyzer may be an electronic nose. The electronic nose may analyze the 
5 headspace of a particular well and generate data associated with the "smell" the electronic nose 
senses. 

The results from the analyzer(s) may be sent to the computer for examination and 
comparison with a look-up table, as shown at block 162. For example, in the flavor component 
H context, if the analyzer registers the components of the well (ie., the types and amounts of 
*|0 components), the computer may take this data and compare it with a look-up table and assign a 
^ value to results. If certain components or certain amounts of components are desired, the 
|! f computer may assess a value, a determination of "good" or "bad," or some other evaluation of 
!U the well based on comparison of the components of the well with the values in the look-up table. 
U As another example, if a certain smell is desired, the headspace may be analyzed to determine its 
EJ5 components/amounts of components and compared with desired components/amounts of 
components. 

The processor stores the results in a table, as shown at block 164, and continues obtaining 
data for each of the wells. The well_number is incremented by 1, as shown at block 166 of FIG. 
5. If the well__number is greater than the total number of wells (X), then the loop is exited, as 
20 shown at block 168 of FIG. 5. Otherwise, the flow chart of FIG. 5 goes to block 154. 

As discussed previously, one method to analyze the data in the newly created table is by 
first determining the "weights" for each of the reaction parameters 172. The reaction parameters 
may include amount of ingredients, types of ingredients, etc. Prior to execution of the program, 
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the operator assigns "weights" based on importance of each reaction parameter. In this manner, 
the results of each of the wells can be assigned a total "score" by multiplying the reaction 
parameters by the "weights" and adding them. For example, if the types of ingredients and 
amount of ingredients are the two parameters of interest, and the type of ingredients is 
5 considered more important than the amount of ingredients (for example, if one of the ingredients 
in the perfume example is very expensive, it may factor in the analysis), the "weights" for each 
can be 0.8 and 0.2, respectively for each of the two parameters. Each of the results for an 
individual well can then be tallied 174. The well_number is set to 1, as shown at block 170. The 
^ well_number is incremented by 1, as shown at block 176 of FIG. 5. If the well_number is greater 
=Jp than the total number of wells (X), then the loop is exited, as shown at block 178 of FIG. 5. 
% 4 Otherwise, the flow chart of FIG. 5 goes to block 174. For parameters which are more desirable 
H when they are lower in value, the result of multiplying the weight by the parameter can be 
J,. inverted, and then added to the total to determine the "score," 

IZ The entries can then be arranged based on the score, as shown at block 180. The 

fft> processor 64 then displays the results of the raw data and the "scores," as shown at block 182. At 
each step in the methodology, the display can be updated to inform the operator of the current 
reaction. For example, when the processor 64 commands or receives information from the 
synthesizer 12, the analyzer 40 or the robot 50, the display can be updated to indicate the current 
operation. 

20 Based on the highest ranked "score," the suggested bounds for the next set of 

experiments are determined 184, 186. For example, if the amount of organic solvent, enzyme and 
substrate so that the enzyme - substrate ratio is as low as possible and the substrate concentration 
as high as possible, the parameters of the well with the highest ranked "score" may be used as a 
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base value for the temperature bounds for the next set of experiments. The suggested parameters 
are then displayed to the operator 188. 

Automated process research can greatly reduce the amount of time and cost involved in 
the production of new biocatalysts and the optimization of their performances. In addition, APR 
can provide statistical response curves so that the enzyme activities are optimized. These 
optimized conditions are not easily be found by examining one variable at a time. The use of 
APR technology could result in an expansion of the biocatalysis field due to a decrease in the 
time constraints involved in this process. 
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Example: Enantioselective hydrolysis of (±)-phenethyl acetate using ThermoCat E009. 

An enantioselective pair enzyme-substrate is selected as one example of the application 
of the DoE for purposes of process optimization. The enzyme is an esterase enzyme and the 
substrate is (±)-phenethyl acetate. The reaction that is being catalyzed is the Enantioselective 
5 hydrolysis of one enatiomer of the racemic acetate. Parameters to be optimized may include, for 
example, the amount of organic solvent, enzyme and substrate, so that the enzyme : substrate 
ratio is as low as possible and the substrate concentration as high as possible. The enzyme is 
enantioselective on all conditions tested as long as it keeps its activity. Referring to Figure 6, 
there is shown the chemical reaction for the current example, 
jf) The set up consists of variable amounts of KPi buffer pH =7.2 containing bromothymol 

|y blue as a pH indicator, MeCN as cosolvent, E009 esterase (lyophilized powder) and racemic 
Tl' phenethyl acetate. Stirring is provided by a tumble stirring system that allows uniform mixing on 
"h a plate holding 8x12 glass vials as microreactors. pH is maintained around neutral by adding 
2 base upon color change in the reaction solution. 

Jfi> The first DoE contained 1 1 experiments involving 3 variables and 2 levels (the variable is 

|I given a high and a low value to evaluate the impact). The results indicated MeCN is the most 
critical factor, and a second DoE was set up using 14 experimental conditions to narrow down 
the MeCN variable (augmented design), involving again 3 variables and 2 levels. A third round 
of DoE (model-robust) contained 10 experiments involving the 3 variables and 2 levels, and a 
20 last DoE was performed (central-composite) again with 3 variables, 2 levels and 17 experiments. 
A total of 52 experiments were set up during the three rounds of Automated Process Research. 
Table 1 shows the experimental breakdown, 3 variables and 1 outcome. 
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Table 1 

First Second 



Exp 


MeCN 


E009 


[Sub] 


conv. 


Exp 


MeCN 


E009 


[Sub] 


conv. 


# 


(%) 


(mfl) 


(mM) 


(%) 


# 


(%) 


(ma) 


(mM) 


(%) 


1 


5 


30 


200 


70 


1 


1 


5 


5 


67 


2 


5 


5 


5 


64.7 


2 


12 


30 


5 


88.9 


3 


50 


30 


5 


0.7 


3 


4 


24 


54 


86.9 


4 


50 


30 


200 


0.1 


4 


9 


11 


151 


60.8 


5 


50 


5 


5 


2 


5 


7 


18 


103 


71.2 


6 


5 


30 


5 


100 


6 


12 


30 


5 


100 


7 


50 


5 


200 


0 


7 


1 


18 


200 


71.9 


8 


5 


5 


200 


32.3 


8 


7 


18 


103 


71.9 


9 


28 


18 


103 


0 


9 


4 


24 


151 


84.6 


10 


28 


18 


103 


1.6 


10 


1 


5 


5 


64 


11 


28 


18 


103 


0 














Second augmented 








Third 









Exp 


MeCN 


E009 


[Sub] 


conv. 


Exp 


MeCN 


E009 


[Sub] 


conv. 




# 


(%) 


(mq) 


(mM) 


(%) 


# 


(%) 


(mp) 


(mM) 


(%) 




1 


12 


5 


103 


32.7 


1 


20 


20 


40 


3.9 




2 


9 


24 


54 


66.2 


2 


12 


30 


38 


7.7 




3 


7 


18 


103 


69.8 


3 


12 


30 


38 


7 




4 


1 


30 


200 


81.8 


4 


12 


30 


38 


7.5 




5 


9 


24 


151 


7.3 


5 


20 


40 


5 


4.3 




6 


7 


18 


103 


66.2 


6 


20 


30 


38 


8 




7 


7 


5 


200 


9.5 


7 


4 


40 


5 


100 


issss 


8 


12 


5 


5 


14.5 


8 


12 


30 


70 


94.2 




9 


1 


5 


103 


38.9 


9 


4 


20 


70 


78.2 




10 


12 


30 


200 


5.6 


10 


4 


40 


70 


93 




11 


1 


30 


5 


93.5 


11 


4 


30 


38 


86.5 




12 


12 


18 


200 


6.3 


12 


4 


20 


5 


34.1 




13 


7 


5 


200 


7.9 


13 


12 


20 


38 


6.9 




14 


1 


30 


5 


95.3 


14 


20 


20 


5 


13 














15 


20 


40 


70 


2.9 














16 


12 


30 


5 


19.3 














17 


12 


40 


38 


19.6 



5 Referring to Figure 7, there is shown a surface graph for two variables while one is fixed. In one 



case, the enzyme amount was fixed, in the other, substrate concentration. The first plot is the 
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total screening space covered, while the second limits the amount of MeCN. One question for 
analysis is the optimum ratio enzyme-substrate. Since the enzyme is very enantioselective, the 
50% conversion mark indicates where that optimum would be. From analysis of the results, the 
optimum appears to be 150 mM, 5 mg/2 mL of enzyme and 5-10% MeCN. 
5 Referring to Figure 8, there is shown the results of the second DoE or augmented design, 

and is a closer look at the response surface. 

An alternative way of examining the data is the contour plots, shown in Figures 9 and 10, 
confronting enzyme and substrate at 5 and 6% MeCN. The area designated by "Y" is the target 
(50% conv) and the extrapolation indicates where to design the next set of experiments, which 
S) may use the ratio E/S as a variable rather than both parameters separately, 
ry If the enzyme involved in the biotransformation needs to be improved in terms of 

fU enantioselectivity, the above method can be used to examine the enantioselectivity of the 

.7 .. 

!B F reaction around 50% conversion as the outcome, rather than the conversion itself. 

It is intended that the foregoing detailed description be regarded as illustrative rather than 
ft limiting and that it is understood that the following claims, including all equivalents, are intended 
li to define the scope of the invention. 
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